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Abstract 

We present three new complexity results for classes of planning problems with simple 
causal graphs. First, we describe a polynomial-time algorithm that uses macros to gen- 
' erate plans for the class 3S of planning problems with binary state variables and acyclic 

O . causal graphs. This implies that plan generation may be tractable even when a planning 

problem has an exponentially long minimal solution. We also prove that the problem of 
plan existence for planning problems with multi- valued variables and chain causal graphs 
^ ' is NP-hard. Finally, we show that plan existence for planning problems with binary state 

, variables and polytree causal graphs is NP-complete. 

o 

1. Introduction 

Planning is an area of research in artificial intelligence that aims to achieve autonomous 
control of complex systems. Formally, the planning problem is to obtain a sequence of 
transformations for moving a system from an initial state to a goal state, given a description 
of possible transformations. Planning algorithms have been successfully used in a variety 
of applications, including robotics, process planning, information gathering, autonomous 
agents and spacecraft mission control. Research in planning has seen significant progress 
during the last ten years, in part due to the establishment of the International Planning 
Competition. 

An important aspect of research in planning is to classify the complexity of solving 
planning problems. Being able to classify a planning problem according to complexity 
makes it possible to select the right tool for solving it. Researchers usually distinguish 
between two problems: plan generation, the problem of generating a sequence of transfor- 
mations for achieving the goal, and plan existence, the problem of determining whether 
such a sequence exists. If the original STRIPS formalism is used, plan existence is unde- 
cidable in the first-order case (Chapman, 1987) and PSPACE-complete in the propositional 
case (Bylander, 1994). Using PDDL, the representation language used at the International 
Planning Competition, plan existence is EXPSPACE-complete (Erol, Nau, & Subrahma- 
nian, 1995). However, planning problems usually exhibit structure that makes them much 
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easier to solve. Helmert (2003) showed that many of the benchmark problems used at the 
International Planning Competition are in fact in P or NP. 

A common type of structure that researchers have used to characterize planning prob- 
lems is the so called causal graph (Knoblock, 1994). The causal graph of a planning 
problem is a graph that captures the degree of independence among the state variables 
of the problem, and is easily constructed given a description of the problem transforma- 
tions. The independence between state variables can be exploited to devise algorithms for 
efficiently solving the planning problem. The causal graph has been used as a tool for 
describing tractable subclasses of planning problems (Brafman & Domshlak, 2003; Jonsson 
& Backstrom, 1998; Williams & Nayak, 1997), for decomposing planning problems into 
smaller problems (Brafman & Domshlak, 2006; Jonsson, 2007; Knoblock, 1994), and as the 
basis for domain- independent heuristics that guide the search for a valid plan (Helmert, 
2006). 

In the present work we explore the computational complexity of solving planning prob- 
lems with simple causal graphs. We present new results for three classes of planning prob- 
lems studied in the literature: the class 3S (Jonsson & Backstrom, 1998), the class C n 
(Domshlak & Dinitz, 2001), and the class of planning problems with polytree causal graphs 
(Brafman & Domshlak, 2003). In brief, we show that plan generation for instances of the 
first class can be solved in polynomial time using macros, but that plan existence is not 
solvable in polynomial time for the remaining two classes, unless P = NP. This work first 
appeared in a conference paper (Gimenez & Jonsson, 2007); the current paper provides 
more detail and additional insights as well as new sections on plan length and CP-nets. 

A planning problem belongs to the class 3S if its causal graph is acyclic and all state 
variables are either static, symmetrically reversible or splitting (see Section 3 for a pre- 
cise definition of these terms). The class 3S was introduced and studied by Jonsson and 
Backstrom (1998) as an example of a class for which plan existence is easy (there exists a 
polynomial-time algorithm that determines whether or not a particular planning problem 
of that class is solvable) but plan generation is hard (there exists no polynomial-time algo- 
rithm that generates a valid plan for every planning problem of the class). More precisely, 
Jonsson and Backstrom showed that there are planning problems of the class 3S for which 
every valid plan is exponentially long. This clearly prevents the existence of an efficient 
plan generation algorithm. 

Our first contribution is to show that plan generation for 3 S is in fact easy if we are 
allowed to express a valid plan using macros. A macro is simply a sequence of operators and 
other macros. We present a polynomial-time algorithm that produces valid plans of this 
form for planning problems of the class 3S. Namely, our algorithm outputs in polynomial 
time a system of macros that, when executed, produce the actual valid plan for the planning 
problem instance. The algorithm is sound and complete, that is, it generates a valid plan 
if and only if one exists. We contrast our algorithm to the incremental algorithm proposed 
by Jonsson and Backstrom (1998), which is polynomial in the size of the output. 

We also investigate the complexity of the class C n of planning problems with multi- 
valued state variables and chain causal graphs. In other words, the causal graph is just a 
directed path. Domshlak and Dinitz (2001) showed that there are solvable instances of this 
class that require exponentially long plans. However, as it is the case with the class 3S, 
there could exist an efficient procedure for generating valid plans for C n instances using 



320 



Complexity of Planning Problems 



macros or some other novel idea. We show that plan existence in C n is NP-hard, hence 
ruling out that such an efficient procedure exists, unless P = NP. 

We also prove that plan existence for planning problems whose causal graph is a poly- 
tree (i.e., the underlying undirected graph is acyclic) is NP-complete, even if we restrict to 
problems with binary variables. This result closes the complexity gap that appears in Braf- 
man and Domshlak (2003) regarding planning problems with binary variables. The authors 
show that plan existence is NP-complete for planning problems with singly connected causal 
graphs, and that plan generation is polynomial for planning problems with polytree causal 
graphs of bounded indegree. We use the same reduction to prove that a similar problem on 
polytree CP-nets (Boutilier, Brafman, Domshlak, Hoos, & Poole, 2004) is NP-complete. 

1.1 Related Work 

Several researchers have used the causal graph to devise algorithms for solving planning 
problems or to study the complexity of planning problems. Knoblock (1994) used the 
causal graph to decompose a planning problem into a hierarchy of increasingly abstract 
problems. Under certain conditions, solving the hierarchy of abstract problems is easier than 
solving the original problem. Williams and Nayak (1997) introduced several restrictions on 
planning problems to ensure tractability, one of which is that the causal graph should be 
acyclic. Jonsson and Backstrom (1998) defined the class 3S of planning problems, which 
also requires the causal graphs to be acyclic, and showed that plan existence is polynomial 
for this class. 

Domshlak and Dinitz (2001) analyzed the complexity of several classes of planning 
problems with acyclic causal graphs. Brafman and Domshlak (2003) designed a polynomial- 
time algorithm for solving planning problems with binary state variables and acyclic causal 
graph of bounded indegree. Brafman and Domshlak (2006) identified conditions under 
which it is possible to factorize a planning problem into several subproblems and solve 
the subproblems independently. They claimed that a planning problem is suitable for 
factorization if its causal graph has bounded tree-width. 

The idea of using macros in planning is almost as old as planning itself (Fikes & Nilsson, 
1971). Minton (1985) developed an algorithm that measures the utility of plan fragments 
and stores them as macros if they are deemed useful. Korf (1987) showed that macros can 
exponentially reduce the search space size of a planning problem if chosen carefully. Vidal 
(2004) used relaxed plans generated while computing heuristics to produce macros that 
contribute to the solution of planning problems. Macro-FF (Botea, Enzenberger, Miiller, 
&l Schaeffer, 2005), an algorithm that identifies and caches macros, competed at the fourth 
International Planning Competition. The authors showed how macros can help reduce the 
search effort necessary to generate a valid plan. 

Jonsson (2007) described an algorithm that uses macros to generate plans for planning 
problems with tree-reducible causal graphs. There exist planning problems for which the 
algorithm can generate exponentially long solutions in polynomial time, just like our algo- 
rithm for 3S. Unlike ours, the algorithm can handle multi-valued variables, which enables it 
to solve problems such as Towers of Hanoi. However, not all planning problems in 3S have 
tree-reducible causal graphs, so the algorithm cannot be used to show that plan generation 
for 3S is polynomial. 
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1.2 Hardness and Plan Length 

A contribution of this paper is to show that plan generation may be polynomial even when 
planning problems have exponential length minimal solutions, provided that solutions may 
be expressed using a concise notation such as macros. We motivate this result below and 
discuss the consequences. Previously, it has been thought that plan generation for planning 
problems with exponential length minimal solutions is harder than NP, since it is not 
known whether problems in NP are intractable, but it is certain that we cannot generate 
exponential length output in polynomial time. 

However, for a planning problem with exponential length minimal solution, it is not 
clear if plan generation is inherently hard, or if the difficulty just lies in the fact that the 
plan is long. Consider the two functional problems 

h(F) = w(l,2\ F \), 

f 2 (F) = w(t(F),2\ F \), 

where F is a 3-CNF formula, \F\ is the number of clauses of F, w(a, k) is a word containing 
k copies of the symbol a, and t(F) is 1 if F is satisfiable (i.e., F is in 3Sat), and if it is 
not. In both cases, the problem consists in generating the correct word. Observe that both 
fi and ji are provably intractable, since their output is exponential in the size of the input. 

Nevertheless, it is intuitive to regard problem j\ as easier than problem f2- One way 
to formalize this intuition is to allow programs to produce the output in some succinct 
notation. For instance, if we allow programs to write "w(<r,/c)" instead of a string containing 
k copies of the symbol a, then problem f\ becomes polynomial, but problem f% does not 
(unless P = NP). 

We wanted to investigate the following question: regarding the class 3S, is plan genera- 
tion intractable because solution plans are long, like fi, or because the problem is intrinsi- 
cally hard, like j-p. The answer is that plan generation for 3S can be solved in polynomial 
time, provided that one is allowed to give the solution in terms of macros, where a macro 
is a simple substitution scheme: a sequence of operators and/or other macros. To back up 
this claim, we present an algorithm that solves plan generation for 3S in polynomial time. 

Other researchers have argued intractability using the fact that plans may have expo- 
nential length. Domshlak and Dinitz (2001) proved complexity results for several classes of 
planning problems with multi- valued state variables and simple causal graphs. They argued 
that the class C n of planning problems with chain causal graphs is intractable since plans 
may have exponential length. Brafman and Domshlak (2003) stated that plan generation 
for STRIPS planning problems with unary operators and acyclic causal graphs is intractable 
using the same reasoning. Our new result puts in question the argument used to prove the 
hardness of these problems. For this reason, we analyze the complexity of these problems 
and prove that they are hard by showing that the plan existence problem is NP-hard. 

2. Notation 

Let V be a set of state variables, and let D(v) be the finite domain of state variable v £ V. 
We define a state s as a function on V that maps each state variable v € V to a value 
s(v) £ D(v) in its domain. A partial state p is a function on a subset V p C V of state 
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variables that maps each state variable v G V p to p(v) G D(v). For a subset C Q V oi 
state variables, p \ C is the partial state obtained by restricting the domain of p to V p n C. 
Sometimes we use the notation (wi = xi, . . . , Vk = Xk) to denote a partial state p defined by 
V p = {vi, . . . , Wfc} and p(yi) = Xi for each Vi G V p . We write p(v) =_L to denote that v £ V p . 

Two partial states p and g match, which we denote pVq, if and only if p | V q = q \ V p , 
i.e., for each v G VpflVg, p(t>) = (/(w). We define a replacement operator © such that if q and 
r are two partial states, p = q © r is the partial state defined by V p = V q U VJ., p(t> ) = r(v) 
for each v G F r , and = for each f G — F r . Note that, in general, p @ q ^ q @ p. 
A partial state p subsumes a partial state q, which we denote p C if and only if pVg and 
V p Q V q . We remark that if p C g and r C s, it follows that p © r C g © s. The difference 
between two partial states q and r, which we denote q — r, is the partial state p defined by 
V p = {v G V q I 7^ r(w)} and p(t>) = q(v) for each t> G V p . 

A planning problem is a tuple P = (V,init,goal,A), where V is the set of variables, 
init is an initial state, goal is a partial goal state, and A is a set of operators. An operator 
a = (pre(a); post(a)) G A consists of a partial state pre(a) called the pre-condition and a 
partial state post(a) called the post- condition. Operator a is applicable in any state s such 
that sS/pre(a), and applying operator a in state s results in the new state s (Bpost(a). A 
valid plan II for P is a sequence of operators that are sequentially applicable in state init 
such that the resulting state s' satisfies s's/goal. 

The causal graph of a planning problem P is a directed graph (V, E) with state variables 
as nodes. There is an edge (u, v) G E if and only if u ^ v and there exists an operator 
a G A such that u G V pre(a) U V post{a) and v G V post(a) . 

3. The Class 3S 

Jonsson and Backstrom (1998) introduced the class 3S of planning problems to study the 
relative complexity of plan existence and plan generation. In this section, we introduce 
additional notation needed to describe the class 3S and illustrate some of the properties of 
3S planning problems. We begin by defining the class 3S: 

Definition 3.1 A planning problem P belongs to the class 3S if its causal graph is acyclic 
and each state variable v G V is binary and either static, symmetrically reversible, or 
splitting. 

Below, we provide formal definitions of static, symmetrically reversible and splitting. 
Note that the fact that the causal graph is acyclic implies that operators are unary, i.e., for 
each operator a G A, \V post ^ \ = 1. Without loss of generality, we assume that 3S planning 
problems are in normal form, by which we mean the following: 

• For each state variable v, D(v) = {0, 1} and init(v) = 0. 

• post(a) = (v = x), x G {0, 1}, implies that pre(a)(v) = 1 — x. 

To satisfy the first condition, we can relabel the values of D{v) in the initial and goal 
states as well as in the pre- and post-conditions of operators. To satisfy the second condition, 
for any operator a with post(a) = (v = x) and pre(a)(v) ^ 1 — x, we either remove it if 
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Figure 1: Causal graph with splitting variable partitions for (a) v, (b) w. 

pre(a)(v) = x, or we let pre(a)(v) = 1 — x if previously undefined. The resulting planning 
problem is in normal form and is equivalent to the original one. This process can be done 
in time 0(|A||y|). 

The following definitions describe the three categories of state variables in 3S: 
Definition 3.2 A state variable v G V is static if and only if one of the following holds: 

1. There does not exist a £ A such that post(a)(v) = 1, 

2. goal(v) = and there does not exist a £ A such that post(a)(v) = 0. 

Definition 3.3 A state variable v (zV is reversible if and only if for each a G A such that 
post{a) = (v = x), there exists a' G A such that post(a') = (v = 1 — x). In addition, v is 
symmetrically reversible if pre(a') | (V — {v}) = pre(a) \ (V — {v}). 

From the above definitions it follows that the value of a static state variable cannot or 
must not change, whereas the value of a symmetrically reversible state variable can change 
freely, as long as it is possible to satisfy the pre-conditions of operators that change its 
value. The third category of state variables is splitting. Informally, a splitting state variable 
v splits the causal graph into three disjoint subgraphs, one which depends on the value 
v = 1, one which depends on v = 0, and one which is independent of v. However, the 
precise definition is more involved, so we need some additional notation. 

For v G V, let Qq be the subset of state variables, different from v, whose value is 
changed by some operator that has v = as a pre-condition. Formally, Qq = {u G V — {v} \ 
3a G A s.t. pre(a)(v) = A u G V pos t(a)}- Define Q\ in the same way for v = 1. Let 
Gq = (V, £q) be the subgraph of (V, E) whose edges exclude those between v and Qq — Q\. 
Formally, E% = E - {(v, w) \ w G Qq A w (£ Q\}. Finally, let V V C V be the subset of state 
variables that are weakly connected to some state variable of Qq in the graph Gq. Define 
Vi in the same way for v = 1. 

Definition 3.4 A state variable v G V is splitting if and only if V V and are disjoint. 

Figure 1 illustrates the causal graph of a planning problem with two splitting state 
variables, v and w. The edge label v = indicates that there are operators for changing 
the value of u that have v = as a pre-condition. In other words, Qq = {u,w}, the graph 
Gq = (V, Eq) excludes the two edges labeled v = 0, and V V includes all state state variables, 
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since v is weakly connected to u and w connects to the remaining state variables. The set 
Q\ is empty since there are no operators for changing the value of a state variable other 
than v with v = 1 as a pre-condition. Consequently, V* is empty as well. Figure 1(a) shows 
the resulting partition for v. 

In the case of w, Qq = {s}, Gq = (V,Eq) excludes the edge labeled w = 0, and 
Vq" = {s}, since no other state variable is connected to s when the edge w = is removed. 
Likewise, V™ = {<}. We use V™ = V — V™ — V™ to denote the set of remaining state 
variables that belong neither to V™ nor to V™ . Figure 1(b) shows the resulting partition 
for w. 

Lemma 3.5 For any splitting state variable v, if the two sets V V and Vf are non-empty, 
v belongs neither to V V nor to V£ . 

Proof By contradiction. Assume that v belongs to Vq. Then v is weakly connected to 
some state variable of Qq in the graph Gq = (V,Eq). But since Eq does not exclude edges 
between v and Q\ , any state variable in Q\ is weakly connected to the same state variable of 
Qq in Gq. Consequently, state variables in Q\ belong to both V V and V", which contradicts 
that v is splitting. The same reasoning holds to show that v does not belong to V" . 

Lemma 3.6 The value of a splitting state variable never needs to change more than twice 
on a valid plan. 

Proof Assume II is a valid plan that changes the value of a splitting state variable v at 
least three times. We show that we can reorder the operators of II in such a way that the 
value of v does not need to change more than twice. We need to address three cases: v 
belongs to V V (cf. Figure 1(a)), v belongs to V", or v belongs to V* (cf. Figure 1(b)). 

If v belongs to V V , it follows from Lemma 3.5 that V[ is empty. Consequently, no 
operator in the plan requires v = 1 as a pre-condition. Thus, we can safely remove all 
operators in IT that change the value of v, except possibly the last, which is needed in case 
goal(v) = 1. If v belongs to V±, it follows from Lemma 3.5 that Vq is empty. Thus, no 
operator in the plan requires v = as a pre-condition. The first operator in II that changes 
the value of v is necessary to set v to 1. After that, we can safely remove all operators in LT 
that change the value of v, except the last in case goal{v) = 0. In both cases the resulting 
plan contains at most two operators changing the value of v. 

If v belongs to V£, then the only edges between Vq, V", and V? are those from v G V* 
to Qq C V v and Q\ C V£. Let LTo, 111 , and LT* be the subsequences of operators in II that 
affect state variables in Vq ,V^ , and V?, respectively. Write II* = (n^a^n'/), where a\ is 
the last operator in II* that changes the value of v from to 1. We claim that the reordering 
(Hq, n^, a\, IIi, n^') of plan II is still valid. Indeed, the operators of Ilo only require v = 0, 
which holds in the initial state, and the operators of LTi only require v = 1, which holds 
due to the operator a\. Note that all operators changing the value of v in II* can be safely 
removed since the value v = 1 is never needed as a pre-condition to change the value of a 
state variable in V? ■ The result is a valid plan that changes the value of v at most twice 
(its value may be reset to by IT*). 
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Variable Operators V£* 
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Table 1: Operators and the sets Vq* and for the example planning problem. 




Figure 2: Causal graph of the example planning problem. 



The previous lemma, which holds for splitting state variables in general, provides some 
additional insight into how to solve a planning problem with a splitting state variable v. 
First, try to achieve the goal state for state variables in Vq while the value of v is 0, as in 
the initial state. Then, set the value of v to 1 and try to achieve the goal state for state 
variables in V® . Finally, if goal{v) = 0, reset the value of v to 0. 

3.1 Example 

We illustrate the class 3S using an example planning problem. The set of state variables 
is V = {vi, . . . ,v$}. Since the planning problem is in normal form, the initial state is 
init(vi) = for each v% € V. The goal state is defined by goal = (v$ = l,vg = 1), and 
the operators in A are listed in Table 1. Figure 2 shows the causal graph (V,E) of the 
planning problem. From the operators it is easy to verify that V4 is static and that v\ 
and vq are symmetrically reversible. For the planning problem to be in 3S, the remaining 
state variables have to be splitting. Table 1 lists the two sets and for each state 
variable Vi € V to show that indeed, n V"' = for each of the state variables in the set 
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4. Plan Generation for 3S 

In this section, we present a polynomial-time algorithm for plan generation in 3S. The 
algorithm produces a solution to any instance of 3S in the form of a system of macros. The 
idea is to construct unary macros that each change the value of a single state variable. The 
macros may change the values of other state variables during execution, but always reset 
them before terminating. Once the macros have been generated, the goal can be achieved 
one state variable at a time. We show that the algorithm generates a valid plan if and only 
if one exists. 

We begin by defining macros as we use them in the paper. Next, we describe the 
algorithm in pseudo-code (Figures 3, 4, and 5) and prove its correctness. To facilitate 
reading we have moved a straightforward but involving proof to the appendix. Following 
the description of the algorithm we analyze the complexity of all steps involved. In what 
follows, we assume that 3S planning problems are in normal form as defined in the previous 
section. 

4.1 Macros 

A macro-operator, or macro for short, is an ordered sequence of operators viewed as a unit. 
Each operator in the sequence has to respect the pre-conditions of operators that follow 
it, so that no pre-condition of any operator in the sequence is violated. Applying a macro 
is equivalent to applying all operators in the sequence in the given order. Semantically, 
a macro is equivalent to a standard operator in that it has a pre-condition and a post- 
condition, unambiguously induced by the pre- and post-conditions of the operators in its 
sequence. 

Since macros are functionally operators, the operator sequence associated with a macro 
can include other macros, as long as this does not create a circular definition. Consequently, 
it is possible to create hierarchies of macros in which the operator sequences of macros on 
one level include macros on the level below. The solution to a planning problem can itself 
be viewed as a macro which sits at the top of the hierarchy. 

To define macros we first introduce the concept of induced pre- and post-conditions of 
operator sequences. If II = (ai, . . . , a^) is an operator sequence, we write Ilj, 1 < i < k, to 
denote the subsequence (ai, . . . , Oj). 

Definition 4.1 An operator sequence II = (ai, . . . ,a&) induces a pre-condition preiji) = 
pre(a,k) © • • • ®pre{a\) and a post- condition post(U) = post{a\) © • • • (&post{ak) . In addition, 
the operator sequence is well-defined if and only if (pre(Jli-i)®post(Jli-i))S7pre(ai) for each 
1< i < k. 

In what follows, we assume that P = (V,init, goal, A) is a planning problem such that 
Vpost(a) Q ^pre(a) f° r each operator a € A, and that II = (a±, . . . , a^) is an operator sequence. 

Lemma 4.2 For each planning problem P of this type and each H, V post (j^ C V^my 

Proof A direct consequence of the definitions V^, re (n) = V^ r e(ai)l> • ■UV pre ( (1( .) and V post ^ = 
Vpost(ai) U • • • U V post ( ak y 
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Lemma 4.3 The operator sequence II is applicable in state s if and only ifU is well-defined 
and sVpre(II). The state Sk resulting from the application of II to s is s k = s (Bpost(IL). 

Proof By induction on k. The result clearly holds for k = 1. For k > 1, note that 
preijl) = pre(ak) (B pre(Hk-i) , post(U) = post(Hk-i) © post(ak), and II is well-defined if 
and only if Hk-i is well-defined and (prejl^-i) © post(Hk-i))Vpre(ak)- 

By hypothesis of induction the state s^-i resulting from the application of Hk-i to s is 
= s © post(Tlk-i)- It follows that = s^-i © post(ak) = s © post(IT). 

Assume IT is applicable in state s. This means that ILj_i is applicable in s and that 
is applicable in s^-i = s (& post(Hk-i) ■ By hypothesis of induction, the former implies that 
sVpre(Uk~i) and II^-i is well-defined, and the latter that (s © post(Jlk-i)) Vpre{ak). This 
last condition implies that (pre(Tlk-i) ®post(Ylk-i))Vpre(ak) if we use that pre(Tlk-i) E s, 
which is a consequence of sVpre(Tlk~i) and s being a total state. Finally, we deduce 
sV(pre(afc) ffipre(IIfc_ 1 )) from sS7pre(Uk-i) and (s © posi(IL;_i))Vpre(afc), by using that 
^post(n fc _!) ^= ^pre(n fe _ 1 )- It follows that II is well-defined and that sVpre(H). 

Conversely, assume that II is well-defined and sVpre(n). This implies that IIfc_i is 
well-defined and sVpre(Uk-i), so by hypothesis of induction, Hk-i is applicable in state s. 
It remains to show that a& is applicable in state s^-i, that is, (s © post(n,t_i)) Vpre(afc). 
From (pre(IIfe_i) ©post(IL;_i)) Vpre(afe) it follows that posi(ITfc_i) Vpre(afc). The fact that 
sV(pre(a fe ) ®pre{Tl k _ 1 )) and ^^(n^,) C ^(n^^ completes the proof. 

Since macros have induced pre- and post-conditions, Lemmas 4.2 and 4.3 trivially extend 
to the case for which the operator sequence II includes macros. Now we are ready to 
introduce our definition of macros: 

Definition 4.4 A macro m is a sequence 11 = (cti, . . . , a^) of operators and other macros 
that induces a pre-condition pre{m) = preijl) and a post- condition post(m) = post(U) — 
pre(n). The macro is well-defined if and only if no circular definitions occur and U is 
well-defined. 

To make macros consistent with standard operators, the induced post-condition should 
only include state variables whose values are indeed changed by the macro, which is achieved 
by computing the difference between post(H) and pre(U). In particular, it holds that for a 
3S planning problem in normal form, derived macros satisfy the second condition of normal 
form, namely that post(m) = (v = x), x G {0, 1}, implies pre{m){v) = 1 — x. 

Definition 4.5 Let Anc v be the set of ancestors of a state variable v in a 3S planning 
problem. We define the partial state pre" on V pre v = Anc v as 

1. pre v {u) = 1 if u G And" is splitting and v £ V™, 

2. pre v (u) = otherwise. 

Definition 4.6 A macro m is a 3S-macro if it is well-defined and, forx £ {0, 1}, post(m) = 
(v = x) and pre(m) C pre" © (v = 1 — x). 
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Table 2: Macros generated by the algorithm in the example planning problem. 



The algorithm we present only generates 3S-macros. In fact, it generates at most one 
macro m = m v x with post(m) = (v = x) for each state variable v and value x G {0, 1}. To 
illustrate the idea of 3S-macros and give a flavor of the algorithm, Table 2 lists the macros 
generated by the algorithm in the example 3S planning problem from the previous section. 

We claim that each macro is a 3S-macro. For example, the operator sequence (a^ 6 ) 
induces a pre-condition (v% = 1,vq = 0) and a post-condition (v^ = 1,vq = 0) © (v^ = 1) = 
(t>3 = 1, vq = 1). Thus, the macro m^ 6 induces a pre-condition pre{m"^) = {v% = 1,vq = 0) 
and a post-condition post(ml 6 ) = (v^ = 1,vq = 1) — (v^ = 1,vq = 0) = (vq = 1). Since i>2 
and are splitting and since vq G V^ 2 and vq G V^ 3 , it follows that pre" 6 © (vq = 0) = 
(vi =0,V2 = 1,^3 = 1,^6 = 0), so pre(rrii 6 ) = (vs = 1,vq = 0) C pre" 6 © (vq = 0). 

The macros can be combined to produce a solution to the planning problem. The idea 
is to identify each state variable v such that goal{v) = 1 and append the macro m\ to the 
solution plan. In the example, this results in the operator sequence (m^ 5 ,m^ 8 ). However, 
the pre-condition of m^ 8 specifies = 1 and vj = 1, which makes it necessary to insert m"^ 
and ml 7 before m^ 8 . In addition, the pre-condition of m"^ specifies V2 = 1, which makes 
it necessary to insert m^ 2 before m^ 3 , resulting in the final plan (m^ 5 , m^ 2 , m\ 3 , m\ 7 , m^ 8 ) . 
Note that the order of the macros matter; m^ 5 requires V3 to be while m^ 8 requires 
«3 to be 1. For a splitting state variable v, the goal state should be achieved for state 
variables in V V before the value of v is set to 1. We can expand the solution plan so that 
it consists solely of operators in A. In our example, this results in the operator sequence 
(a\ 5 , a"i , a^ 2 , Oq 1 , a^ 3 , a\ 6 , a^ 7 , a V Q , a^ 8 ) . In this case, the algorithm generates an optimal 
plan, although this is not true in general. 

4.2 Description of the Algorithm 

We proceed by providing a detailed description of the algorithm for plan generation in 3S. 
We first describe the subroutine for generating a unary macro that sets the value of a state 
variable v to x. This algorithm, which we call GenerateMacro, is described in Figure 3. 
The algorithm takes as input a planning problem P, a state variable v, a value x (either 
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1 function GenerateMacro(P, v, x, M) 

2 for each a G A such that post(a)(v) = x do 

3 S <- 5i <- () 

4 satisfy <— true 

5 £7 <- {u G F pre((I ) - {v} | pre(a)(u) = 1} 

6 for each u € f/ in increasing topological order do 

7 if u is static or mf ^ M then 

8 satisfy ^— false 

9 else if u is not splitting and mj} G M and mf G M then 

10 S ^(S ,m%) 

11 Si (m?, Si) 

12 if satisfy then 

13 return (Si, a, So) 

14 return fail 



Figure 3: Algorithm for generating a macro that sets the value of v to x. 

or 1), and a set of macros M for u's ancestors in the causal graph. Prior to executing the 
algorithm, we perform a topological sort of the state variables. We assume that, for each 
v G V and x G {0, 1}, M contains at most one macro m v x such that post(m x ) = (v = x). In 
the algorithm, we use the notation m x G M to test whether or not M contains m v x . 

For each operator oGi that sets the value of v to x, the algorithm determines whether 
it is possible to satisfy its pre-condition pre(a) starting from the initial state. To do this, the 
algorithm finds the set U of state variables to which pre(a) assigns 1 (the values of all other 
state variables already satisfy pre(a) in the initial state). The algorithm constructs two 
sequences of operators, So and Si, by going through the state variables of U in increasing 
topological order. If S is an operator sequence, we use (S, o) as shorthand to denote an 
operator sequence of length |S| + 1 consisting of all operators of S followed by o, which can 
be either an operator or a macro. If it is possible to satisfy the pre-condition pre(a) of some 
operator a G A, the algorithm returns the macro (Si, a, So). Otherwise, it returns fail. 

Lemma 4.7 If v is symmetrically reversible and GenerateMacro^P, v, 1, M) success- 
fully generates a macro, so does GenerateMacro^P, v, 0, M). 

Proof Assume that GenerateMacro(P, v, 1, M) successfully returns the macro (Si, a, So) 
for some operator a £ A such that post(a) = 1. From the definition of symmetrically 
reversible it follows that there exists an operator a' G A such that post{a') = and 
pre(a') \ V — {v} = pre(a) \ V — {v}. Thus, the set U is identical for a and a'. As 
a consequence, the values of So, Si, and satisfy are the same after the loop, which 
means that GenerateMacro(P, v, 0, M) returns the macro (Si, a', So) for a'. Note 
that GenerateMacro(P, v, 0, M) may return another macro if it goes through the op- 
erators of A in a different order; however, it is guaranteed to successfully return a macro. 

Theorem 4.8 If the macros in M are 3S-macros and GENERATEMACRO (P, v, x, M) 
generates a macro m v x / fail, then m v x is a 3S-macro. 
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function MACRO-3S(P) 
M <- 

for each v G V in increasing topological order do 



<- GenerateMacro(P, v, 1, M) 
rajj «- GenerateMacro(P, u, 0, M) 
if / /ai/ and / /ai/ then 




else if m\ ^ fail and goal(v) / then 
M <- M U{m\} 
return GeneratePlan(P, V, M) 



Figure 4: The algorithm MACRO-3S. 



The proof of Theorem 4.8 appears in Appendix A. 

Next, we describe the algorithm for plan generation in 3S, which we call MACRO-3S. 
Figure 4 shows pseudocode for MACRO-3S. The algorithm goes through the state variables 
in increasing topological order and attempts to generate two macros for each state variable 
v, m\ and tuq. If both macros are successfully generated, they are added to the current 
set of macros M. If only m\ is generated and the goal state does not assign to v, the 
algorithm adds m\ to M. Finally, the algorithm generates a plan using the subroutine 
GeneratePlan, which we describe later. 

Lemma 4.9 Let P be a 3S planning problem and let v G V be a state variable. If there 
exists a valid plan for solving P that sets v to 1, MACRO-3S (P ) adds the macro m\ to M. 
If, in addition, the plan resets v to 0, MACRO-3S (P) adds to M. 

Proof First note that if m\ and mjj are generated, Macro-3S(P) adds them both to M. 
If m\ is generated but not t«q, Macro-3S(P) adds m\ to M unless goal(v) = 0. However, 
goal(v) = contradicts the fact that there is a valid plan for solving P that sets v to 1 
without resetting it to 0. It remains to show that GenerateMacro(P, v, 1, M) always 
generates m\ / fail and that GenerateMacro(P, v, 0, M) always generates ttt-q / fail 
if the plan resets v to 0. 

A plan for solving P that sets u to 1 has to contain an operator a £ A such that 
post(a)(v) = 1. If the plan also resets v to 0, it has to contain an operator a' G A such 
that post{a!){v) = 0. We show that GenerateMacro(P, v, 1, M) successfully generates 
m\ fail if a is the operator selected on line 2. Note that the algorithm may return another 
macro if it selects another operator before a; however, if it always generates a macro for a, 
it is guaranteed to successfully return a macro m\ / fail. The same is true for and a'. 

We prove the lemma by induction on state variables v. If v has no ancestors in the 
causal graph, the set U is empty by default. Thus, satisfy is never set to false and 
GenerateMacro(P, v, 1, M) successfully returns the macro m\ = (a) for a. If a' exists, 
GenerateMacro(P, v, 0, M) successfully returns uiq = (a') for a'. 

If v has ancestors in the causal graph, let U = {u G V pre ^ — {v} \ pre(a)(u) = 1}. 
Since the plan contains a it has to set each u G U to 1. By hypothesis of induction, 
MACRO-3S(P) adds m\ to M for each u G U. As a consequence, satisfy is never set to 
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1 function GeneratePlan(P, W, M) 

2 if \W\ = then 

3 return () 

4 v <— first variable in topological order present in W 

5 if v is splitting then 

6 ng «- Generate-Plan(P, W n (V^ - {v}), M) 

7 nf <- Generate-Plan(P, W n (VJ" - {v}), M) 

8 IIJ <- Generate-Plan(P, Wn (V -VJ - V? - {v}), M) 

9 if ng = /ail or Ilf = /aii or IIJ = fail or (#oaZ(v) = 1 and m\ £ M) then 

10 return fail 

11 else if m\ £M then return <H^, IPg, IF?) 

12 else if goal(v) = then return (II", rig, mf, Uy, mg) 

13 else return (IIJ , Ilg , m? , II? ) 

14 n <- Generate-Plan(P, IF - {v}, M) 

15 if n = fail or (goal(v) = 1 and mf ^ M) then return fail 

16 else if goal(v) = 1 then return (II, mf) 

17 else return 14 

Figure 5: Algorithm for generating the final plan 

false and thus, GenerateMacro(P, v, 1, M) successfully returns m\ for a. If a' exists, 
let W = {w € ^pre(a') ~~ { v } I pre(a')(w) = 1}. If the plan contains a', it has to set each 
w G W to 1. By hypothesis of induction, Macro-3S(P) adds to M for each w & W 
and consequently, GenerateMacro(P, v, 0, M) successfully returns rag for a'. 

Finally, we describe the subroutine GeneratePlan(P, W, M) for generating the final 
plan given a planning problem P, a set of state variables W and a set of macros M. If 
the set of state variables is empty, GeneratePlan(P, W, M) returns an empty operator 
sequence. Otherwise, it finds the state variable v € W that comes first in topological order. 
If v is splitting, the algorithm separates W into the three sets described by Vq , Vj", and 
V£ = V — Vq — Vf\ The algorithm recursively generates plans for the three sets and if 
necessary, inserts m\ between Vq and V™ in the final plan. If this is not the case, the 
algorithm recursively generates a plan for W — {v}. If goal(v) = 1 and m\, the algorithm 
appends m\ to the end of the resulting plan. 

Lemma 4.10 Let Tl w be the plan generated by GeneratePlan (P, W, M), let v be the 
first state variable in topological order present in W , and let LT^ = (IP, n^, IP) be the final 
plan generated by MACRO-SSfPj. Ifm\ G M it follows that (pre(W)®post(Ii a ))Vpre(m v l ). 

Proof We determine the content of the operator sequence IP that precedes II 1 ^ in the final 
plan by inspection. Note that the call GeneratePlan(P, W, M) has to be nested within 
a sequence of recursive calls to GeneratePlan starting with GeneratePlan(P, V, M). 
Let Z be the set of state variables such that each u <G Z was the first state variable in 
topological order for some call to GeneratePlan prior to GeneratePlan(P, W, M). 
Each u € Z has to correspond to a call to GeneratePlan with some set of state variables 
U such that W C U. If u is not splitting, u does not contribute to IP since the only 
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possible addition of a macro to the plan on line 16 places the macro m" at the end of the 
plan generated recursively. 

Assume that u G Z is a splitting state variable. We have three cases: W C Vq", W C V", 
and VF C V™ = F — Vq 1 — V"". If^CVJ 1 ,!! does not contribute to IP since it never places 
macros before II". If W C Vq", the plan II" is part of IP since it precedes IIq on lines 11, 

12, and 13. If W C 1/", the plans II" and IIq are part of IP since they both precede II" in 
all cases. If m" G M, the macro m" is also part of IT 1 since it precedes II" on lines 12 and 

13. No other macros are part of 11°. 

Since the macros in M are unary, the plan generated by GeneratePlan(P, U, M) 
only changes the values of state variables in U. For a splitting state variable u, there are 
no edges from V™ — {u} to Vq", from V™ — {u} to Vf, or from Vq" to V™. It follows that the 
plan II" does not change the value of any state variable that appears in the pre-condition 
of a macro in IIq. The same holds for II" with respect to II" and for IIq with respect to II". 
Thus, the only macro in IP that changes the value of a splitting state variable u G And" is 
m" in case W C V". 

Recall that pre" is defined on And" and assigns 1 to u if and only if u is splitting and 
v G V"". For all other ancestors of v, the value holds in the initial state and is not 
altered by II a . If u is splitting and v G V", it follows from the definition of 3S-macros that 
pre{m\){u) = 1 or pre(m\)(u) =_L. If pre(m\)(u) = 1, it is correct to append m" before 
m'l to satisfy pre{m\){u). If m" ^ M it follows that u £ V^ re ( m «), since pre(m\)(u) = 1 
would have caused GenerateMacro(P, v, 1, M) to set satisfy to false on line 8. Thus, 
the pre-condition pre(m\) of agrees with pre(H a ) (B post(H a ) on the value of each state 
variable, which means that the two partial states match. 

Lemma 4.11 GeneratePlan^P, V, M) generates a well-defined plan. 

Proof Note that for each state variable v G V, GeneratePlan(P, W, M) is called 
precisely once such that v is the first state variable in topological order. From Lemma 4.10 
it follows that (pre(H a ) © post(Ji a ))Vpre{m\), where IP is the plan that precedes H w in 
the final plan. Since v is the first state variable in topological order in W, the plans IIq, 
H\, 11", and II, recursively generated by GeneratePlan, do not change the value of any 
state variable in pre(rrii). It follows that m\ is applicable following (n a ,n^,riQ) or (IP, II}. 
Since m\ only changes the value of v, nriQ is applicable following (IP, II", IIq, m\, U\). 

Theorem 4.12 MACRO-3S (P) generates a valid plan for solving a planning problem in 3S 
if and only if one exists. 

Proof GeneratePlan (P, V, M) returns fail if and only if there exists a state variable 
v G V such that goal(v) = 1 and m\ ^ M. From Lemma 4.9 it follows that there does 
not exist a valid plan for solving P that sets v to 1. Consequently, there does not exist a 
plan for solving P. Otherwise, GeneratePlan (P, V, M) returns a well-defined plan due 
to Lemma 4.11. Since the plan sets to 1 each state variable v such that goal(v) = 1 and 
resets to each state variable v such that goal{v) = 0, the plan is a valid plan for solving 
the planning problem. 
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Figure 6: Causal graph of the planning problem P5. 

4.3 Examples 

We illustrate the algorithm on an example introduced by Jonsson and Backstrom (1998) to 
show that there are instances of 3S with exponentially sized minimal solutions. Let P n = 
(V,init, goal, A) be a planning problem defined by a natural number n, V = {v±, . . . ,v n }, 
and a goal state defined by V goa i = V, goal{vi) = for each Vi £ {v\, . . . , v n -\}, and 
goal(v n ) = 1. For each state variable Vi G V, there are two operators in A: 

a T = (( v i = 0, . . . ,Vi- 2 = 0,Vi-i = l,Vi = 0); (vi = 1)}, 

a o l = (( v i = 0, . . . ,Vi- 2 = 0,Vi-i = l,Vi = 1); (vi = 0)}. 

In other words, each state variable is symmetrically reversible. The causal graph of the plan- 
ning problem P5 is shown in Figure 6. Note that for each state variable V{ <G {v\, . . . , v n -2}, 
pre{a"^ +1 ){vi) = 1 and pre{a v ^ +2 ){vi) = 0, so Vi+i G Q v { and Vi + 2 € Qq. Since there is 
an edge in the causal graph between Vi+i and no state variable in {v±, . . . , 1^-2} is 

splitting. On the other hand, v n -\ and v n are splitting since V^" -1 = and Vq" = V^ n = 0. 
Backstrom and Nebel (1995) showed that the length of the shortest plan solving P n is 2 n — 1, 
i.e., exponential in the number of state variables. 

For each state variable G {vi, . . . , v n -±}, our algorithm generates two macros m v ^ and 
Wq'. There is a single operator, a\\ that changes the value of v,- L from to 1. pre(a\ i ) 
only assigns 1 to so U = {vi-±}. Since V{-\ is not splitting, is defined as m v ± = 
(m^ l_1 , a\% raQ ,_1 ). Similarly, mjj' is defined as = (m^ 1 ,Oq ,mg ,_1 ) . For state variable 
v n , U = {v n -i}, which is splitting, so m^ n is defined as m^ n = {a^ 1 ). 

To generate the final plan, the algorithm goes through the state variables in topolog- 
ical order. For state variables v\ through v n -2, the algorithm does nothing, since these 
state variables are not splitting and their goal state is not 1. For state variable v n -i, 
the algorithm recursively generates the plan for v n , which is (m^ n ) since goal{v n ) = 1. 
Since goal(v n -\) = 0, the algorithm inserts m^ n_1 before m^ n to satisfy its pre-condition 
v n -i = 1 and f^o™ 1 & ft er m i™ to achieve the goal state goal(v n -i) = 0. Thus, the final plan 
is (ml"' 1 , ml", mj™" 1 ). If we expand the plan, we end up with a sequence of 2 n — 1 opera- 
tors. However, no individual macro has operator sequence length greater than 3. Together, 
the macros recursively specify a complete solution to the planning problem. 

We also demonstrate that there are planning problems in 3S with polynomial length 
solutions for which our algorithm may generate exponential length solutions. To do this, 
we modify the planning problem P n by letting goal{vi) = 1 for each Vi £ V. In addition, 
for each state variable Vi € V, we add two operators to A: 

b T = (( v i = !,••• ,Vi-i = l,Vi = 0); (vi = 1)), 
b o = (( v i = !,••• ,Vi-i = l,Vi = 1); (vi = 0)). 
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We also add an operator c\ n = ((v n -i = 0,v n = 0); (v n = 1)) to A. As a conse- 
quence, state variables in {v±, . . . ,v n -2} are still symmetrically reversible but not splitting. 
v n -\ is also symmetrically reversible but no longer splitting, since pre{a v l n )(v n ^i) = 1 and 
pre{cl n ){v n -i) = implies that v n G V^"' 1 HV^"' 1 . v n is still splitting since V Vn = V? n = 0. 
Assume that GenerateMacro(P, Vi, x, M) always selects b v J first. As a consequence, for 
each state variable vi G V and each x G {0, 1}, GenerateMacro(P, Vi, x, M) generates 
the macro m v J = (m"^ 1 , . . . , m^ 1 , b v J , m^ 1 , . . . , rn"^ 1 ) . 

Let Li be the length of the plan represented by m v J, x G {0, 1}. From the definition of 
above we have that Lj = 2{L\ + . . . + + 1. We show by induction that Lj = 3 l_1 . 
The length of any macro for v\ is L\ = 1 = 3°. For i > 1, 

qi-l _ -y 3 i_1 — 1 

Li = 2(3° + . . . + 3 1 " 2 ) + 1 = 2 + 1 = 2 + 1 = 3* -1 - 1 + 1 = 3 l_1 . 

3 1 2 

To generate the final plan the algorithm has to change the value of each state variable 
from to 1, so the total length of the plan is L = L\ + . . . + L n = 3° + . . . + 3™ _1 = 
(3 n — l)/2. However, there exists a plan of length n that solves the planning problem, 
namely (6^, . . . , b\ n ). 



4.4 Complexity 



In this section we prove that the complexity of our algorithm is polynomial. To do this 
we analyze each step of the algorithm separately. A summary of the complexity result for 
each step of the algorithm is given below. Note that the number of edges \E\ in the causal 
graph is 0(|A||F|), since each operator may introduce 0(| V|) edges. The complexity result 
0(\V\ + \E\) = 0(|^4||y|) for topological sort follows from Cormen, Leiserson, Rivest, and 
Stein (1990). 



Constructing the causal graph G = (V, E) 
Calculating V* and Vq for each v G V 
Performing a topological sort of G 
GenerateMacro(P, v, x, M) 
GeneratePlan(P, V, M) 
Macro-3S(P) 



0(\A\\V\) 
0(L4||Vf) 
0(\A\\V\) 
0(\A\\V\) 

0(|Vf) 
0(\A\\Vf) 



Lemma 4.13 
Lemma 4.14 

Lemma 4.15 
Lemma 4.16 
Theorem 4.17 



Lemma 4.13 The complexity of constructing the causal graph G = (V, E) is 0(\A\\V\). 

Proof The causal graph consists of \V\ nodes. For each operator a G A and each state 
variable u G lp re ( a ) , we should add an edge from u to the unique state variable v G V post ^ . 



In the worst case, \V, 



pre(a) I 



0(|V|), in which case the complexity is 0(|.A||V| 



Lemma 4.14 The complexity of calculating the sets Vq and V± for each state variable 
v eV is 0(\A\\V\ 2 ). 



Proof For each state variable v G V, we have to establish the sets Qq and Q\, which requires 
going through each operator a G A in the worst case. Note that we are only interested in 
the pre-condition on v and the unique state variable in V post ^, which means that we do not 
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need to go through each state variable in V pre ( a y Next, we have to construct the graph Gq. 
We can do this by copying the causal graph G, which takes time 0(|A||V|), and removing 
the edges between v and Qq — Q\, which takes time 0(|V|). 

Finally, to construct the set we should find each state variable that is weakly con- 
nected to some state variable u £ Qq in the graph Gq. For each state variable u G Qq, 
performing an undirected search starting at u takes time 0(|A||V|). Once we have per- 
formed search starting at u, we only need to search from state variables in Qq that were 
not reached during the search. This way, the total complexity of the search does not exceed 
0(|-A||V|). The case for constructing V£ is identical. Since we have to perform the same 
procedure for each state variable v 6 V, the total complexity of this step is 0(|^4||~l/| 2 ). 

Lemma 4.15 The complexity of GenerateMacrO(P, v, x, M) is 0(\A\\V\). 

Proof For each operator a € A, GenerateMacro(P, v, x, M) needs to check whether 
post(a)(v) = x. In the worst case, \U\ = 0(\V\), in which case the complexity of the 
algorithm is 0(|A||V|). 

Lemma 4.16 The complexity of GeneratePlan(P ; V, M) is 0(\V\ 2 ). 

Proof Note that for each state variable v G V, GeneratePlan(P, V, M) is called re- 
cursively exactly once such that v is the first variable in topological order. In other words, 
GeneratePlan(P, V, M) is called exactly |V| times. GeneratePlan(P, V, M) contains 
only constant operations except the intersection and difference between sets on lines 6-8. 
Since intersection and set difference can be done in time 0(|V|), the total complexity of 
GeneratePlan(P, V, M) is 0{\V\ 2 ). 

Theorem 4.17 The complexity of M ACROSS (P ) is 0(|^||y| 2 ). 

Proof Prior to executing Macro-3S(P), it is necessary to construct the causal graph G, 
find the sets V V and V™ for each state variable v G V, and perform a topological sort 
of G. We have shown that these steps take time 0(|.A||V| 2 ). For each state variable 
v £ V, Macro-3S(P) calls GenerateMacro(P, v, x, M) twice. From Lemma 4.15 it 
follows that this step takes time 0(2|Vp||y|) = 0{\A\\V\ 2 ). Finally, Macro-3S(P) calls 
GeneratePlan(P, V, M), which takes time 0(|F| 2 ) due to Lemma 4.16. It follows that 
the complexity of Macro-3S(P) is 0(|,4| |V| 2 ). 

We conjecture that it is possible to improve the above complexity result for Macro- 
3S(P) to 0(|A||y|). However, the proof seems somewhat complex, and our main objective 
is not to devise an algorithm that is as efficient as possible. Rather, we are interested in 
establishing that our algorithm is polynomial, which follows from Theorem 4.17. 

4.5 Plan Length 

In this section we study the length of the plans generated by the given algorithm. To begin 
with, we derive a general bound on the length of such plans. Then, we show how to compute 
the actual length of some particular plan without expanding its macros. We also present 
an algorithm that uses this computation to efficiently obtain the i-th action of the plan 
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from its macro form. We start by introducing the concept of depth of state variables in the 
causal graph. 

Definition 4.18 The depth d{y) of a state variable v is the longest path from v to any 
other state variable in the causal graph. 

Since the causal graph is acyclic for planning problems in 3S, the depth of each state variable 
is unique and can be computed in polynomial time. Also, it follows that at least one state 
variable has depth 0, i.e., no outgoing edges. 

Definition 4.19 The depth d of a planning problem P in 3S equals the largest depth of 
any state variable v of P, i.e., d = max^v d{v). 

We characterize a planning problem based on the depth of each of its state variables. Let 
n = \V\ be the number of state variables, and let q denote the number of state variables 
with depth i. If the planning problem has depth d, it follows that cq + . . . + c d = n. As an 
example, consider the planning problem whose causal graph appears in Figure 2. For this 
planning problem, n = 8, d = 5, Co = 2, c\ = 2, c 2 = 1, C3 = 1, C4 = 1, and C5 = 1. 

Lemma 4.20 Consider the values Li for i £ {0,...,d} defined by L d = 1, and Li = 
2(ci + \Li + i + c.j + 2^i+2 + . . . + c d L d ) + 1 when i < d. The values L, L are an upper bound on 
the length of macros generated by our algorithm for a state variable v with depth i. 

Proof We prove it by a decreasing induction on the value of i. Assume v has depth i = d. 
It follows from Definition 4.18 that v has no incoming edges. Thus, an operator changing 
the value of v has no pre-condition on any state variable other than v, so L d = 1 is an upper 
bound, as stated. 

Now, assume v has depth i < d, and that all L i+ ^ for k > are upper bounds on the 
length of the corresponding macros. Let o £ A be an operator that changes the value of v. 
From the definition of depth it follows that a cannot have a pre-condition on a state variable 
u with depth j < i; otherwise there would be an edge from u to v in the causal graph, causing 
the depth of u to be greater than i. Thus, in the worst case, a macro for v has to change 
the values of all state variables with depths larger than i, change the value of v, and reset 
the values of state variables at lower levels. It follows that Li = 2(q + iLj + i + . . . + cjLd) + 1 
is an upper bound. 

Theorem 4.21 The upper bounds Li of Lemma 4- 20 satisfy Li = II^ =i+1 (l + 2cj). 
Proof Note that 

Li = 2(c i+ iL i+1 + c i+2 L i+ 2 + . . . + c d L d ) + 1 = 
= 2c i+1 L i+1 + 2(ci +2 L i+ 2 + . . . + c d L d ) + 1 = 
= 2ci + \Li + \ + Lj + i = (2cj+i + l)Lj + i. 

The result easily follows by induction. 
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Now we can obtain an upper bound L on the total length of the plan. In the worst 
case, the goal state assigns a different value to each state variable than the initial state, 
i.e., goal{y) ^ init{v) for each v € V. To achieve the goal state the algorithm applies one 
macro per state variable. Hence 

T T T , j T , Lq-1 (1 + 2 C q)Lo-1 l/r 1 

L = c L + ciLi + . . . + c d L d = c L H — = = - ||( 1 + 2c,-) - -. 

3=0 

The previous bound depends on the distribution of the variables on depths according 
to the causal graph. To obtain a general bound that does not depend on the depths of the 
variables we first find which distribution maximizes the upper bound L. 

Lemma 4.22 The upper bound L = | Yl^ =0 (^-+2cj) — \ 071 V^ an riing problems on n variables 
and depth d is maximized when all Ci are equal, that is, Ci = n/(d + 1). 

Proof Note that q > for all i, and that Co + ■ ■ ■ + c d = n. The result follows from a direct 
application of the well known AM-GM (arithmetic mean-geometric mean) inequality, which 
states that the arithmetic mean of positive values X{ is greater or equal than its geometric 
mean, with equality only when all Xi are the same. This implies that the product of positive 
factors Xi = (1 + 2cj) with fixed sum A = Ylj=o x j = 2n + d is maximized when all are 
equal, that is, Cj = n/{d+ 1). 

Theorem 4.23 The length of a plan generated by the algorithm for a planning problem in 
3S with n state variables and depth d is at most ((1 + 2n/(d+ l)) d+1 — l)/2. 

Proof This is a direct consequence of Lemma 4.22. Since cq, . . . , c d are discrete, it may not 
be possible to set c = . . . = c d = n/(d + l). Nevertheless, ((1 + 2n/(d + l)) d+1 - l)/2 is an 
upper bound on L in this case. 

Observe that the bound established in Theorem 4.23 is an increasing function of d. This 
implies that for a given d, the bound also applies to planning problems in 3S with depth 
smaller than d. As a consequence, if the depth of a planning problem in 3S is bounded 
from above by d, our algorithm generates a solution plan for the planning problem with 
polynomial length 0(n d+1 ). Since the complexity of executing a plan is proportional to 
the plan length, we can use the depth d to define tractable complexity classes of planning 
problems in 3S with respect to plan execution. 

Theorem 4.24 The length of a plan generated by the algorithm for a planning problem in 
3S with n state variables is at most (3 ra — l)/2. 

Proof In the worst case, the depth d of a planning problem is n— 1. It follows from Theorem 
4.23 that the length of a plan is at most ((1 + 2n/n) n - l)/2 = (3 n - l)/2. 

Note that the bound established in Theorem 4.24 is tight; in the second example in Section 
4.3, we showed that our algorithm generates a plan whose length is (3 n — l)/2. 
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1 function Operator^, i) 

2 o 4— first operator in S 

3 while length(o) < i do 

4 i «— i — length(o) 

5 o •<— next operator in S 1 

6 if primitive(o) then 

7 return o 

8 else 

9 return Operator(o, i) 

Figure 7: An algorithm for determining the i-th operator in a sequence 

Lemma 4.25 The complexity of computing the total length of any plan generated by our 
algorithm is 0(|V| 2 ). 

Proof The algorithm generates at most 2\V\ = 0(\V |) macros, 2 for each state variable. The 
operator sequence of each macro consists of one operator and at most 2(\V\ — 1) = 0(|V|) 
other macros. We can use dynamic programming to avoid computing the length of a macro 
more than once. In the worst case, we have to compute the length of 0(|V|) macros, each 
of which is a sum of 0(|V|) terms, resulting in a total complexity of 0(|V| 2 ). 

Lemma 4.26 Given a solution plan of length I and an integer 1 < i < I, the complexity of 
determining the i-th operator of the plan is 0(\V\ 2 ). 

Proof We prove the lemma by providing an algorithm for determining the i-th operator, 
which appears in Figure 7. Since operator sequences S consist of operators and macros, 
the variable o represents either an operator in A or a macro generated by MACRO-3S. The 
function primitive(o) returns true if o is an operator and false if o is a macro. The function 
length(o) returns the length of o if o is a macro, and 1 otherwise. We assume that the length 
of macros have been pre-computed, which we know from Lemma 4.25 takes time 0(|y| 2 ). 

The algorithm simply finds the operator or macro at the i-th. position of the sequence, 
taking into account the length of macros in the sequence. If the i-th position is part of 
a macro, the algorithm recursively finds the operator at the appropriate position in the 
operator sequence represented by the macro. In the worst case, the algorithm has to go 
through 0(|V|) operators in the sequence S and call Operator recursively 0(|V|) times, 
resulting in a total complexity of OdV^ 2 ). 

4.6 Discussion 

The general view of plan generation is that an output should consist in a valid sequence of 
grounded operators that solves a planning problem. In contrast, our algorithm generates a 
solution plan in the form of a system of macros. One might argue that to truly solve the 
plan generation problem, our algorithm should expand the system of macros to arrive at the 
sequence of underlying operators. In this case, the algorithm would no longer be polynomial, 
since the solution plan of a planning problem in 3S may have exponential length. In fact, if 
the only objective is to execute the solution plan once, our algorithm offers only marginal 
benefit over the incremental algorithm proposed by Jonsson and Backstrom (1998). 
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On the other hand, there are several reasons to view the system of macros generated by 
our algorithm as a complete solution to a planning problem in 3S. The macros collectively 
specify all the steps necessary to reach the goal. The solution plan can be generated and 
verified in polynomial time, and the plan can be stored and reused using polynomial memory 
It is even possible to compute the length of the resulting plan and determine the i-th 
operator of the plan in polynomial time as shown in Lemmas 4.25 and 4.26. Thus, for all 
practical purposes the system of macros represents a complete solution. Even if the only 
objective is to execute the solution plan once, our algorithm should be faster than that of 
Jonsson and Backstrom (1998). All that is necessary to execute a plan generated by our 
algorithm is to maintain a stack of currently executing macros and select the next operator 
to execute, whereas the algorithm of Jonsson and Backstrom has to perform several steps 
for each operator output. 

Jonsson and Backstrom (1998) proved that the bounded plan existence problem for 3S 
is NP-hard. The bounded plan existence problem is the problem of determining whether or 
not there exists a valid solution plan of length at most k. As a consequence, the optimal 
plan generation problem for 3S is NP-hard as well; otherwise, it would be possible to 
solve the bounded plan existence problem by generating an optimal plan and comparing 
the length of the resulting plan to k. In our examples we have seen that our algorithm 
does not generate an optimal plan in general. In fact, our algorithm is just as bad as the 
incremental algorithm of Jonsson and Backstrom, in the sense that both algorithms may 
generate exponential length plans even though there exists a solution of polynomial length. 

Since our algorithm makes it possible to compute the total length of a valid solution 
in polynomial time, it can be used to generate heuristics for other planners. Specifically, 
Katz and Domshlak (2007) proposed projecting planning problems onto provably tractable 
fragments and use the solution to these fragments as heuristics for the original problem. We 
have shown that 3S is such a tractable fragment. Unfortunately, because optimal planning 
for 3S is NP-hard, there is no hope of generating an admissible heuristic. However, the 
heuristic may still be informative in guiding the search towards a solution of the original 
problem. In addition, for planning problems with exponential length optimal solutions, a 
standard planner has no hope of generating a heuristic in polynomial time, making our 
macro-based approach (and that of Jonsson, 2007) the only (current) viable option. 

5. The Class C n 

Domshlak and Dinitz (2001) defined the class C n of planning problems with multi- valued 
state variables and chain causal graphs. Since chain causal graphs are acyclic, it follows that 
operators are unary. Moreover, let Vi be the i-th state variable in the chain. If % > 1, for 
each operator a such that V post ^ C {v{\ it holds that V pre ^ = {vi-\,Vi}. In other words, 
each operator that changes the value of a state variable Vi may only have pre-conditions on 
Vi-i and Vi. 

The authors showed that there are instances of C n with exponentially sized minimal 
solutions, and therefore argued that the class is intractable. In light of the previous section, 
this argument on the length of the solutions does not discard the possibility that instances 
of the class can be solved in polynomial time using macros. We show that this is not the 
case, unless P = NP. 
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Figure 9: Domain transition graph for V{. 



We define the decision problem Plan-Existence-C„ as follows. A valid input of Plan- 
Existence-C„ is a planning instance P of C n . The input P belongs to Plan-Existence- 
C n if and only if P is solvable. We show in this section that the problem Plan-Existence- 
C n is NP-hard. This implies that, unless P = NP, solving instances of C„ is a truly 
intractable problem, namely, no polynomial-time algorithm can distinguish between solvable 
and unsolvable instances of C n . In particular, no polynomial-time algorithm can solve C n 
instances by using macros or any other kind of output format. 1 

We prove that Plan-Existence-C^ is NP-hard by a reduction from Cnf-Sat, that is, 
the problem of determining whether a CNF formula F is satisfiable. Let C±, . . . ,C n be the 
clauses of the CNF formula F, and let v±, . . . ,Vk be the variables that appear in F. We 
briefly describe the intuition behind the reduction. The planning problem we create from 
the formula F has a state variable for each variable appearing in F, and plans are forced 
to commit a value (either or 1) to these state variables before actually using them. Then, 
to satisfy the goal of the problem, these variables are used to pass messages. However, the 
operators for doing this are defined in such a way that a plan can only succeed when the 
state variable values it has committed to are a satisfying assignment of F. 

We proceed to describe the reduction. First ,we define a planning problem P(F) = 
(V, init, goal, A) as follows. The set of state variables is V = {v\, . . . , Vk,w}, where D(vi) = 
{S, 0, 1, Ci, C{, . . . , C n , C' n } for each Vi and D(w) = {S, 1, . . . , n}. The initial state defines 
init(v) = S for each v G V and the goal state defines goal(w) = n. Figure 8 shows the 
causal graph of P(F). 

The domain transition graph for each state variable Vi is shown in Figure 9. Each node 
represents a value in D(vi), and an edge from x to y means that there exists an operator 
a such that pre(a)(vi) = x and post(a)(vi) = y. Edge labels represent the pre-condition of 
such operators on state variable and multiple labels indicate that several operators 

are associated with an edge. We enumerate the operators acting on Vi using the notation 
a = (pre(a);post(a)) (when i = 1 any mention of Vi-\ is understood to be void): 



1. A valid output format is one that enables efficient distinction between an output representing a valid 
plan and an output representing the fact that no solution was found. 



341 



GlMENEZ & JONSSON 



@77r(^ -%77r© 

Figure 10: Domain transition graph for w. 

(1) Two operators {vi-i = S,Vi = S;vi = 0) and (i>j_i = S,Vi = S;vi = 1) that allow 
to move from S 1 to either or 1. 

(2) Only when i > 1. For each clause Cj and each X £ {Cj,Cj}, two operators 

= X, Vi = 0;v,i = Cj) and = X, Uj = = Cj). These operators al- 

low vi to move to Cj or if has done so. 

(3) For each clause Cj and each X € {0, 1}, an operator (vi-i = X, V{ = 0; vi = Cj) if Vi 
occurs in clause Cj, and an operator (t>j_i = X, = 1; vi = Cj) if occurs in clause 
Cj. These operators allow Vi to move to Cj or Cj even if v^-i has not done so. 

(4) For each clause Cj and each X = {0, 1}, two operators (fj-i = X, «j = C,;t>j = 0) 
and (vi-i = X,Vi = C'-\Vi = 1). These operators allow v% to move back to or 1. 

The domain transition graph for state variable w is shown in Figure 10. For every clause 
Cj the only two operators acting on w are (v^ = X, w = j — 1; w = j), where X £ {Cj, Cj} 
(if j = 1 , the pre-condition w = j — 1 is replaced by w = S) . 

Proposition 5.1 A CNF formula F is satisfiable if and only if the planning instance P(F) 
is solvable. 

Proof The proof follows from a relatively straightforward interpretation of the variables 
and values of the planning instance P(F). For every state variable Vi, we must use an 
operator of (1) to commit to either or 1. Note that, once this choice is made, variable Vi 
cannot be set to the other value. The reason we need two values Cj and Cj for each clause 
is to enforce this commitment (Cj corresponds to Vi = 0, while Cj corresponds to = 1). 
To reach the goal the state variable w has to advance step by step along the values 1, . . . , n. 
Clearly, for every clause Cj there must exist some variable V{ that is first set to values Cj 
or Cj using an operator of (3). Then, this "message" can be propagated along variables 
Vi + \, ■ . . ,Vk using operators of (2). Note that the existence of an operator of (3) acting on 
Vi implies that the initial choice of or 1 for state variable Vi, when applied to the formula 
variable V{, makes the clause Cj true. Hence, if II is a plan solving P(F), we can use the 
initial choices of IT on state variables Vi to define a (partial) assignment a that satisfies all 
clauses of F. 

Conversely, if a is some assignment that satisfies F, we show how to obtain a plan II 
that solves P(F). First, we set every state variable Vi to value cr(vi). For every one of the 
clauses Cj, we choose a variable Vi among those that make Cj true using assignment a. 
Then, in increasing order of j, we set the state variable v\ corresponding to clause Cj to a 
value Cj or Cj (depending on o~(vi)), and we pass this message along v.i + \, . . . , Vk up to w. 

Theorem 5.2 Plan-Existence-C„ is NP-hard. 
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Figure 11: Causal graph of Pp when F = C\ A C2 A C3 on three variables x,y,z. 

Proof Producing a planning instance P(F) from a CNF formula F can be easily done in 
polynomial time, so we have a polynomial-time reduction Cnf-Sat < p Plan-Existence- 



6. Polytree Causal Graphs 

In this section, we study the class of planning problems with binary state variables and 
polytree causal graphs. Brafman and Domshlak (2003) presented an algorithm that finds 
plans for problems of this class in time 0{n 2K ), where n is the number of variables and 
k is the maximum indegree of the polytree causal graph. Brafman and Domshlak (2006) 
also showed how to solve in time roughly (9(n w<5 ) planning domains with local depth 5 and 
causal graphs of tree- width co. It is interesting to observe that both algorithms fail to solve 
polytree planning domains in polynomial time for different reasons: the first one fails when 
the tree is too broad (unbounded indegree), the second one fails when the tree is too deep 
(unbounded local depth, since the tree- width oj of a polytree is 1). 

In this section we prove that the problem of plan existence for polytree causal graphs 
with binary variables is NP-hard. Our proof is a reduction from 3Sat to this class of 
planning problems. As an example of the reduction, Figure 11 shows the causal graph of 
the planning problem Pp that corresponds to a formula F with three variables and three 
clauses (the precise definition of Pp is given in Proposition 6.2). Finally, at the end of this 
section we remark that the same reduction solves a problem expressed in terms of CP-nets 
(Boutilier et al., 2004), namely, that dominance testing for polytree CP-nets with binary 
variables and partially specified CPTs is NP-complete. 

Let us describe briefly the idea behind the reduction. The planning problem Pp has two 
different parts. The first part (state variables v x , v x , ■ ■ ■ , vq 1 , i/ c , . . . , and v\) depends on 
the formula F and has the property that a plan may change the value of v\ from to 1 as 
many times as the number of clauses of F that a truth assignment can satisfy. However, this 
condition on v\ cannot be stated as a planning problem goal. We overcome this difficulty 
by introducing a second part (state variables v\, V2, ■ ■ ■ , vt) that translates it to a regular 
planning problem goal. 

We first describe the second part. Let P be the planning problem (V,init, goal, A) 
where V is the set of state variables \y\, . . . , t>2fc-i} an d A is the set of 4k — 2 operators 
{ai, . . . , «2fc-i) • • • , /$2fc-i}- For i = l, the operators are defined as a.\ = (vi = 1; v\ = 0) 
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and j3\ = (v\ = Q;v\ = 1). For i > 1, the operators are «j = (vi-\ = 0,Vi = l;vi = 0) and 
A = = = 0; v i = 1). The initial state is init(vi) = for all i, and the goal state 
is goal{vi) = if i is even and goal(vi) = 1 if odd. 

Lemma 6.1 Any valid plan for planning problem P changes state variable v\ from to 1 
at least k times. There is a valid plan that achieves this minimum. 

Proof Let Ai and Bi be, respectively, the sequences of operators (a\, ... ,(Xi) and (Pi, . . . , /%). 
It is easy to verify that the plan (i?2fc-i, A2k-2: ^2k-3j • • • > B%, A2, B{) solves the planning 
problem P. Indeed, after applying the operators of Ai (respectively, the operators of Bi), 
variables v\,...,Vi become (respectively, 1). In particular, variable Vi attains its goal 
state (0 if i is even, 1 if i is odd). Subsequent operators in the plan do not modify Vi, so 
the variable remains in its goal state until the end. The operator j3i appears k times in the 
plan (one for each sequence of type Bi), thus the value of v\ changes k times from to 1. 

We proceed to show that k is the minimum. Consider some plan LT that solves the 
planning problem P, and let \ be the number of operators Qjj and appearing in LT (in 
other words, A, is the number of times that the value of Vi changes, either from to 1 or 
from 1 to 0). Note that the number of times operator appears is equal to or precisely one 
more than the number of occurrences of ctj. We will show that Aj_i > Aj. Since \2k-1 < L 
this implies that X\ > 2k — 1, so that plan LT has, at least, k occurrences of /3i, completing 
the proof. 

We show that Aj_i > A«. Let Si be the subsequence of operators a« and in plan LT. 
Clearly, Si starts with (since the initial state is Vi = 0), and the same operator cannot 
appear twice consecutively in Si, so Si = /3i,cti, fii,ai, etc. Also note that, for i > 1, has 
Vi-\ = 1 as a pre-condition, and aj has vi-\ = 0, hence there must be at least one operator 
ccj-i in plan LT betweeen any two operators j3i and Oj. For the same reason we must have 
at least one operator A-i between any two operators ai and /3j, and one operator f$i-\ 
before the first operator This shows that \-\ > A«. On the other hand, variables Vi and 
Vi-i have different values in the goal state, so subsequences Si and Si-i must have different 
lengths, that is, Aj_i 7^ Aj. Together, this implies Aj_i > Aj, as desired. 

Proposition 6.2 3Sat reduces to plan existence for planning problems with binary vari- 
ables and polytree causal graphs. 

Proof Let F be a CNF formula with k clauses and n variables. We produce a planning 
problem Pp with 2n + 4k — 1 state variables and 2n + 147s — 3 operators. The planning 
problem has two state variables v x and Vx for every variable x in F, two state variables vq 
and v'q for every clause C in F, and 2k — 1 additional variables v\,..., V2k-i- All variables 
are in the initial state. The (partial) goal state is defined by V goa i = {v±, . . . , V2k-i}, 
goal(vi) = when i is even, and goal(vi) = 1 when i is odd, like in problem P of Lemma 
6.1. The operators are: 

(1) Operators (v x = 0; v x = 1) and (v x = 0; Vx = 1} for every variable x of F. 

(2) Operators {v' c = 0; v ' c = 1), {v' c = 0,vc = 0;vc = 1) and (v' c = l,vc = l;«c = 0) 
for every clause C of F. 
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(3) Seven operators for every clause C, one for each partial assignment that satisfies C. 
Without loss of generality, let x, y, and z be the three variables that appear in clause C. 
Then for each operator a among these seven, V pre ^ = {v x ,Vx,v y ,Vy,v z ,Vz,vc,vi}, 
V P ost(a) = {vi}, pre(a)(v c ) = 1, pre{a){v 1 ) = 0, and post{a){v{) = 1. The pre- 
condition on state variables v x ,Ux,v y ,Vy,v z ,Vz depends on the corresponding satis- 
fying partial assignment. For example, the operator corresponding to the partial 
assignment {x = 0, y = 0, z = 1} of the clause C = x V y V z has the pre-condition 

(V x = 0, Vx = 1, Vy = 0, Vy = 1, v z = 1, Vz = 0). 

(4) An operator ((yC,v c = 0),vi =l;vi= 0). 

(5) Operators ctj = = 0, Vi = l;vi = 0} and = (vj-i = = 0; t> j = 1) for 
2 < i < 2k — 1 (the same operators as in problem P except for a,\ and f}\). 

We note some simple facts about problem Pp. For any variable x, state variables v x and 
^ in Pp start at 0, and by applying the operators in (1) they can change into 1 but not 
back to 0. In particular, a plan II cannot reach both of the partial states (v x = l,Vx = 0) 
and (v x = 0, v% = 1) during the course of its execution. 

Similarly, if C is a clause of F, state variable vc can change from to 1 and, by first 
changing v' c into 1, vc can change back to 0. No further changes are possible, since no 
operator brings back v' c to 0. 

Now we interpret operators in (3) and (4), which are the only operators that affect v\. 
To change v\ from to 1 we need to apply one of the operators in (3), thus we require 
vc = 1 for a clause C. But the only way to bring back v\ to is applying the operator in 
(4) which has as pre-condition that vc = 0. We deduce that every time that v\ changes its 
value from to 1 and then back to in plan II, at least one of the k state variables vc is 
used up, in the sense that vc has been brought from to 1 and then back to 0, and cannot 
be used again for the same purpose. 

We show that F is in 3Sat if and only if there is a valid plan for problem Pp. Assume 
F is in 3Sat, and let a be a truth assignment that satisfies F. Consider the following plan 
IT. First, we set v x = cr(x) and = 1 — c(x) for all variables x using the operators of (1). 
Then, for a clause C in F, we set vc = 1, we apply the operator of (3) that corresponds to 
a restricted to the variables of clause C (at this point, v\ changes from to 1), then we set 
v' c = 1 and vc = 0, and we apply the operator of (4) (at this point, v\ change from 1 to 
0). By repeating this process for every clause C of F we are switching the state variable v\ 
exactly k times from to 1. Now, following the proof of Lemma 6.1, we can easily extend 
this plan II' to a plan II that sets all variables v\ to their goal values. 

We show the converse, namely, that the existence of a valid plan II in Pp implies that F 
is satisfiable. Define an assignment a by setting a(x) = 1 if the partial state {v x = 1, v x = 0} 
appears during the execution of II, and u{x) = otherwise. (Recall that at most one of the 
partial states {v x = 1, v x = 0} and {v x = 0, v x = 1} can appear during the execution of any 
plan). By Lemma 6.1, II must be such that state variable v\ changes from to 1 at least k 
times. This implies that k operators of (3), all of them corresponding to different clauses, 
have been used to move v\ from to 1. But to apply such an operator, the values of state 
variables {v x ,Vx} must satisfy the corresponding clause. Thus the assignment a satisfies all 
the k clauses of F. 
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Theorem 6.3 Plan existence for planning problems with binary variables and polytree 
causal graph is NP-complete. 

Proof Due to Proposition 6.2 we only need to show that the problem is in NP. But 
Brafman and Domshlak (2003) showed that this holds in the more general setting of planning 
problems with causal graphs where each component is directed-path singly connected (that 
is, there is at most one directed path between any pair of nodes). Their proof exploits a 
non-trivial auxiliary result: solvable planning problems on binary variables with a directed- 
path singly connected causal graph have plans of polynomial length (the same is not true 
for non-binary variables, or unrestricted causal graphs). 

6.1 CP-nets 

Boutilier et al. (2004) introduced the notion of a CP-net as a graphical representation of 
user preferences. In brief, a CP-net is a network of dependences on a set of variables: the 
preferences the user has for a variable depend on the values of some of the others, under the 
ceteris paribus (all else being equal) assumption, that is, the user preferences on the variable 
are completely independent of the values of the variables not mentioned. The preferences 
for a variable given its parent variables in the network are stored in conditional preference 
tables, or CPTs. 

Boutilier et al. (2004) showed that the dominance query problem in acyclic CP-nets, 
that is, the problem of deciding if one variable outcome is preferable to another, can be 
expressed in terms of a planning problem. The network of dependences of the CP-net 
becomes the causal graph of the planning problem. 

However, under certain conditions, we can perform the opposite process: transform 
a planning problem into a CP-net and a dominance query problem, such that answering 
the query amounts to solving the planning problem. This is possible under the following 
conditions on planning problems with acyclic causal graph and binary variables: 

1. Two operators that modify the same variable in opposing directions must have non- 
matching prevail conditions (the prevail condition of an operator a is the partial state 
pre(a) \ V - V post(a) ). 

2. We must allow partially specified CPTs in the CP-net description. 

The first condition guarantees that we obtain consistent CPTs from the planning instance 
operators. The second condition ensures that the reduction is polynomial-size preserving, 
since fully specified CPTs are exponential in the maximum node indegree of the CP-net. 

In particular, the planning instance Pp we reduced F to satisfies the first condition. 
(Note that this is not true for the planning problem P of Lemma 6.1, but we drop the 
reversing operators a.\ and j3\ when constructing Pp in Proposition 6.2.) As a consequence, 
we can claim the following: 

Theorem 6.4 Dominance testing for polytree CP-nets with binary variables and partially 
specified CPTs is NP-complete. 
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7. Conclusion 

We have presented three new complexity results for planning problems with simple causal 
graphs. First, we provided a polynomial-time algorithm that uses macros to generate solu- 
tion plans for the class 3S. Although the solutions are generally suboptimal, the algorithm 
can generate representations of exponentially long plans in polynomial time. This has sev- 
eral implications for theoretical work in planning, since it has been generally accepted that 
exponentially sized minimal solutions imply that plan generation is intractable. Our work 
shows that this is not always the case, provided that one is allowed to express the solution 
in a succinct notation such as macros. We also showed that plan existence for the class C n 
is NP-hard, and that plan existence for the class of planning problems with binary variables 
and polytree causal graph is NP-complete. 

Jonsson and Backstrom (1998) investigated whether plan generation is significantly 
harder than plan existence. Using the class 3S, they demonstrated that plan existence 
can be solved in polynomial time, while plan generation is intractable in the sense that 
solution plans may have exponential length. Our work casts new light on this result: even 
though solution plans have exponential length, it is possible to generate a representation 
of the solution in polynomial time. Thus, it appears as if for the class 3S, plan generation 
is not inherently harder than plan existence. We are not aware of any other work that 
determines the relative complexity of plan existence and plan generation, so the question 
of whether plan generation is harder that plan existence remains open. 

A potential criticism of our algorithm is that a solution in the form of macros is not 
standard, and that it is intractable to expand the system of macros to arrive at the possibly 
exponentially long sequence of underlying operators. Although this is true, we have shown 
that the system of macros share several characteristics with a proper solution. It is possible 
to generate and validate the solution in polynomial time, and the solution can be stored 
using polynomial memory. We also showed that it is possible to compute the total length 
of the solution in polynomial time, as well as determine which is the i-th operator in the 
underlying sequence. 

Since they are relatively simple, the class C n and the class of planning problems with 
binary state variables and polytree causal graphs could be seen as promising candidates for 
proving the relative complexity of plan existence and plan generation. However, we have 
shown that plan existence for C n is NP-hard, and that plan existence for planning problems 
with polytree causal graphs is NP-complete. Consequently, these classes cannot be used 
to show that plan generation is harder than plan existence, since plan existence is already 
difficult. Our work also closes the complexity gaps that appear in the literature regarding 
these two classes. 

It is however possible that there exist subsets of planning problems in these classes 
for which plan existence can be solved in polynomial time. In fact, for polytree causal 
graphs in binary variables we know that this is the case, due to the algorithms of Brafman 
and Domshlak (2003, 2006) mentioned in Section 6. Hence the plan generation problem 
is polynomial if we restrict to polytree causal graphs with either bounded indegree k or 
bounded local depth S. Consequently, our reduction from 3Sat exhibits both unbounded 
indegree and unbounded local depth. 
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Similarly, one may ask if the class C n of planning problems has some parameter that, 
when bounded, would yield a tractable subclass. The state variables in our reduction have 
domains whose size depends on the number of clauses of the corresponding CNF formula, 
so the domain size appears as an interesting candidate. Planning problems of C n with 
binary variables are tractable due to the work of Brafman and Domshlak (2003), but the 
ideas they use do not extend to domain sizes other than 2. Hence it would be interesting 
to investigate whether the problem of plan existence for the class C n is easier if the size of 
the state variable domains is bounded by a constant. 

Appendix A. Proof of Theorem 4.8 

Assume that GenerateMacro(P, v, x, M) successfully returns the macro m v x = {Si, a, So). 
Let U = {u € V we ! a \ — {v} | pre(a)(u) = 1} and let W = {w\, . . . , w^} C U be the set 
of state variables in U such that Wi is not splitting, {m^ ,m™ % } £ M, and Wi comes be- 
fore Wj in topological order if and only if % < j. It follows that no u 6 U is static, that 
6*1 = (m™ k , . . . , m™ 1 } and that Sq = (m^ 1 , . . . , rn Q Uk ) . Since each state variable w i 6 W is 
not splitting, it has to be symmetrically reversible. 

Lemma A.l For each Wi € W , pre Wl C pre" . 

Proof Since Wi £ V pre ^ and v £ V post t a -\, there is an edge from Wi to v in the causal graph. 
Thus, any ancestor of Wi is also an ancestor of v, so Anc Wi C Anc v . For a state variable 
u € Anc Wi , pre Wi (u) = 1 if and only if u is splitting and Wi 6 V". The graph Gf = (V, Ef) 
includes the edge from Wi to v, which means that v € V™ if and only if Wi £ V™. It follows 
that pre Wi (u) = 1 if and only if pre v (u) = 1, and as a consequence, pre Wi C pre v . 

Let II = (So, a, Si). For each wi 6 W and y € {0, 1}, let IL^ be the sequence preceding 
the macro in II, that is, iPf = (m™ k , . . . ,mf +1 ) and 11^ = (So,a,m^\ . . . .m^'" 1 ). 
Further, let IP be the sequence appearing before a, that is, IP = (So). 

Lemma A. 2 For each 1 < i < k, the post- conditions of sequences n™% IP, and IIq' are 

• post(lTf ) = = l,...,w k = l), 

• post(IP) = (ioi = 1, . . . , w k = 1), 

• posting) = (wi = 0, . . . = 0,Wi = 1, . . . = l,v = x). 

Proof A direct consequence of post((ai, . . . , a^)) = posi(ai)©- • -@post(ak) and post^m™*) = 
(wi = y), post(a) = (v = x). 

Lemma A. 3 For each 1 < i < k, the pre-conditions of sequences II™ 1 , IP, IIq% and U 
satisfy pre(II^ 1 ) C pre(IP) C pre^*) C pre(II) C pre" ®(v = 1- x). 

Proof Since pre((ai, a k )) = pre(ah)®- ■ •©pre(ai), it follows that pre^™') Cpre(IP) C 
pre(IlQ < ) C pre(II). We prove that pre(II) C pre" ® (y = 1 — x). For a state variable u 
such that pre(n)(it) /_L, let m u be the first operator in (So, a, Si) such that u € V^ re ( m «), 
so that pre (II) (it) = pre(m u )(u). 
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If m u = m™\ then it follows that pre(m u ) C pre Wi © (wi = 0) C pre", where we have 
used that m™* is a 3S-macro, Wi is symmetrically reversible, and that pre Wi C pre 11 due to 
Lemma A.l. In particular, pre(m u )(u) = pre v (u). 

Since we assume that planning problems are in normal form, u = W{ implies that 
u <G V pre ( m wiy It follows that if m u / m™' for all i, then u ^ Wi for all i If m" = m^' 
we have that pre(m u ) C pre" 11 © (u>j = 1), but due to u 7^ Wi, we deduce pre(m u )(u) = 
pre Wl (u) =pre v (u). 

Finally, consider the case m u = a. If u = v then pre(m u )(u) = 1 — x, as desired. If 
u 7^ v is splitting, then either u belongs to Vq and pre(m u )(u) = 0, or v belongs to and 
pre(m u )(u) = 1. That is, pre(m u )(u) = pre v (u). If u 7^ w is symmetrically reversible it 
follows that pre(m u )(u) = 0, since the case pre(m u )(u) = 1 would have forced the algorithm 
to either fail or include u in VF\ If u 7^ w is static, pre(m u )(u) = 0, else the algorithm would 
have failed. 

Lemma A. 4 Let p,p', q and r be partial states. If p Cp' and (p' © q) Vr, t/ien (p © (?) Vr. 
Proof A direct consequence of p © q C p' © (7. 

Lemma A. 5 TTie macro m" x generated by the algorithm is well-defined. 

Proof Since II only includes macros for the ancestors of v in the causal graph, and since 
the causal graph is acyclic, no cyclic definitions occur. It remains to show that, for a macro 
m in II and a sequence U m preceding m in II, it holds that (pre (l~I m ) © post(U m ))'\/ 'pre(m) . 
Note that due to Lemmas A. 3 and A. 4 it is enough to show that 

(a) {pre" ®(v = \-x) © post(U™ 1 )) Vpre(m^), 

(b) (pre" (B(v = 1- x) © post(U a ))Vpre(a), 

(c) (pre" ®(v = l — x) ©post(iIo i ))Vpre(mQ l ). 

Case (a) follows easily since V post ^ t ^ ny^^ij = and pre(m^ 1 ) = pre Wl ®(wi = 0) C 
pre 1 '. Case (c) is similar, although this time we must use that post(rf^ l )(u> j) = 1 and 
post (IIq *)(?/;,,•) = for j < i, as required by pre^m^ 1 ) = pre Wi © (lUj = 1). Finally, case (b) 
holds because a variable u £ V pre ( a ) can be either u = v, which is covered by (v = 1 — x), 
splitting or static, which is covered by pre" , or symmetrically reversible, which is covered 
by pre"(u) = if pre(a)(u) = 0, and by post(Jl a )(u) = 1 if pre(a)(u) = 1. 

In remains to show that m x is a 3S-macro. It follows from Lemmas A. 3 and A. 5 that it 
is well-defined and it satisfies pre(m x ) = pre(LI) C pre" © (v = 1 — x). Finally, post(m x ) = 
postiji) — pre(U) = (v = x) is a direct consequence of post (IT) = (w\ = 0, . . . , Wk = 0, v = x) 
from Lemma A. 2, and pre(H)(wi) = 0, pre(H)(v) = 1 — x from the proof of Lemma A. 3. 
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